Distinguished Paper: Automatic Local Memory Architecture Generation for Data Reuse in Custom Data Paths
نویسندگان
چکیده
Traditional high level synthesis is able to yield high computational resource utilisation and short critical paths. The shortcomings of the generated designs usually lies in the memory architecture. To achieve good performance on a FPGA, the data must reside in the fast on-chip memories, but these are commonly too small for the data being processed. Traditional high level synthesis cannot cope with this situation. In this paper we present a technique for automatic generation of a memory architecture, data paths and associated controllers from a high level language such as C. Data reused during the processing are stored in a local memory, resulting in high performance even when the data are stored in shared off-chip memory. The technique is based on data dependence and data access pattern analysis. Commonly used data are duplicated in on-chip memory. High memory efficiency is achieved by rearranging the data memory layout during copying. We have applied our technique to typical signal analysis tasks. The results show that the data path does not need to stall waiting for data, even when all data are stored in a shared off-chip memory. The experiments have been carried out on a Xilinx Virtex2 FPGA.
منابع مشابه
Optimizing Cost Function in Imperialist Competitive Algorithm for Path Coverage Problem in Software Testing
Search-based optimization methods have been used for software engineering activities such as software testing. In the field of software testing, search-based test data generation refers to application of meta-heuristic optimization methods to generate test data that cover the code space of a program. Automatic test data generation that can cover all the paths of software is known as a major cha...
متن کاملAutomatic Data Path Generation from C code for Custom Processors
The stringent performance constraints and short time to market of modern digital systems require automatic methods for design of high performance applicationspecific architectures. This paper presents a novel algorithm for automatic generation of custom pipelined data path for a given application from its C code. The data path optimization targets both resource utilization and performance. The ...
متن کاملData-Reuse and Parallel Embedded Architectures for Low-Power, Real-Time Multimedia Applications
Exploitation of data re-use in combination with the use of custom memory hierarchy that exploits the temporal locality of data accesses may introduce significant power savings, especially for dataintensive applications. The effect of the data-reuse decisions on the power dissipation but also on area and performance of multimedia applications realized on multiple embedded cores is explored. The ...
متن کاملExploiting Data Reuse in Modern Fpgas: Opportunities and Challenges for Compilers
Current high-end Field-Programmable-Gate-Array (FPGA) parts offer a large number of configurable resources. These can be organized in custom storage structures such as tapped-delay lines, in addition to a number of very dense highcapacity Random-Access-Memory (RAM) and Content-Addressable-Memory (CAM) blocks. The extreme flexibility of the size, organization and interconnection between these st...
متن کاملOn the data reuse and memory bandwidth analysis for full-search block-matching VLSI architecture
This work explores the data reuse properties of fullsearch block-matching (FSBM) for motion estimation (ME) and associated architecture designs, as well as memory bandwidth requirements. Memory bandwidth in high-quality video is a major bottleneck to designing an implementable architecture because of large frame size and search range. First, memory bandwidth in ME is analyzed and the problem is...
متن کامل